Learning with errors

Learning with errors (LWE) is a problem in machine learning. A generalization of the parity learning problem, it has recently[1][2] been used to create public-key cryptosystems based on worst-case hardness of some lattice problems. The problem was introduced[1] by Oded Regev in 2005.

Given access to samples (x,y) where x\in \mathbb{Z}_q^n and y \in \mathbb{Z}_q, with assurance that, for some fixed linear function f:\mathbb{Z}_q^n \rightarrow \mathbb{Z}_q , y=f(x) with high probability and deviates from it according to some known noise model, the algorithm must be able to recreate f or some close approximation of it.

Contents

Definition

Denote by \mathbb{T}=\mathbb{R}/\mathbb{Z} the additive group on reals modulo one. Denote by A_{s,\phi} the distribution on \mathbb{Z}_q^n \times \mathbb{T} obtained by choosing a vector \mathbf{a}\in \mathbb{Z}_q^n uniformly at random, choosing e according to a probability distribution \phi on \mathbb{T} and outputting (\mathbf{a},\langle \mathbf{a},s \rangle /q %2B e) for some fixed vector \mathbf{s} \in \mathbb{Z}_q^n where the division is done in the field of reals, and the addition in \mathbb{T}.

The learning with errors problem LWE_{q,\phi} is to find s \in \mathbb{Z}_q^n, given access to polynomially many samples of choice from A_{s,\phi}.

For every \alpha > 0, denote by D_\alpha the one-dimensional Gaussian with density function D_\alpha(x)=\rho_\alpha(x)/\alpha where \rho_\alpha(x)=e^{-\pi(|x|/\alpha)^2}, and let \Psi_\alpha be the distribution on \mathbb{T} obtained by considering D_\alpha modulo one. The version of LWE considered in most of the results would be LWE_{q,\Psi_\alpha}

Decision version

The LWE problem described above is the search version of the problem. In the decision version, the goal is to distinguish between noisy inner products and uniformly random samples from \mathbb{Z}_q^n \times \mathbb{T} (practically, some discretized version of it). Regev[1] showed that the decision and search versions are equivalent when q is a prime bounded by some polynomial in n.

Solving decision assuming search

Intuitively, it is easy to see that if we have a procedure for the search problem, the decision version can be solved easily: just feed the input samples for the decision version to the procedure for the search problem, and check if the s returned by the search procedure can generate the input pairs modulo some noise.

Solving search assuming decision

For the other direction, suppose that we have a procedure for the decision problem, the search version can be solved as follows: Recover the s one co-ordinate at a time. Suppose we are guessing the first co-ordinate, and we are trying to test if s_i=k for a fixed k\in Z_q. Choose a number l \in \mathbb{Z}^*_q uniformly at random. Denote the given samples by \{(a_i,b_i)\} \subset \mathbb{Z}^n_q \times \mathbb{T}. Feed the transformed pairs \{(a_i%2B(l,0,\ldots,0),b_i%2B(lk)/q)\} to the decision problem. It is easy to see that if the guess k was correct, the transformation takes the distribution A_{s,\chi} to itself, and otherwise takes it to the uniform distribution. Since we have a procedure for the decision version which distinguishes between these two types of distributions, and errs with very small probability, we can test if the guess k equals the first co-ordinate. Since q is a prime bounded by some polynomial in n, k can only take polynomially many values, and each co-ordinate can be efficiently guessed with high probability.

Hardness results

Regev's result

For a n-dimensional lattice L, let smoothing parameter \eta_\epsilon(L) denote the smallest s such that \rho_{1/s}(L^*\setminus \{\mathbf{0}\}) \leq \epsilon where L^* is the dual of L and \rho_\alpha(x)=e^{-\pi(|x|/\alpha)^2} is extended to sets by summing over function values at each element in the set. Let D_{L,r} denote the discrete Gaussian distribution on L of width r for a lattice L and real r>0. The probability of each x \in L is proportional to \rho_r(x).

The discrete Gaussian sampling problem(DGS) is defined as follows: An instance of DGS_\phi is given by an n-dimensional lattice L and a number r \geq \phi(L). The goal is to output a sample from D_{L,r}. Regev shows that there is a reduction from GapSVP_{100\sqrt{n}\gamma(n)} to DGS_{\sqrt{n}\gamma(n)/\lambda(L^*)} for any function \gamma(n).

Regev then shows that there exists an efficient quantum algorithm for DGS_{\sqrt{2n}\eta_\epsilon(L)/\alpha} given access to an oracle for LWE_{q,\Psi_\alpha} for integer q and \alpha \in (0,1) such that \alpha q > 2\sqrt{n}. This implies the hardness for LWE. Although the proof of this assertion works for any q, for creating a cryptosystem, the q has to be polynomial in n.

Peikert's result

Peikert proves[2] that there is a probabilistic polynomial time reduction from the GapSVP_{\zeta,\gamma} problem in the worst case to solving LWE_{q,\Psi_\alpha} using poly(n) samples for parameters \alpha \in (0,1), \gamma(n)\geq n/(\alpha \sqrt{\log{n}}), \zeta(n) \geq \gamma(n) and q \geq (\zeta/\sqrt{n}) \omega \sqrt{\log{n}}).

Use in Cryptography

The LWE problem serves as a versatile problem used in construction of several[1][2][3][4] cryptosystems. In 2005, Regev[1] showed that the decision version of LWE is hard assuming quantum hardness of the lattice problems GapSVP_\gamma (for \gamma as above) and SIVP_{t} with t=Õ(n/\alpha). In 2009, Peikert[2] proved a similar result assuming only the classical hardness of the related problem GapSVP_{\zeta,\gamma}. The disadvantage of Peikert's result is that it bases itself on a non-standard version of an easier (when compared to SIVP) problem GapSVP.

Public-key cryptosystem

Regev[1] proposed a public-key cryptosystem based on the hardness of the LWE problem. The cryptosystem as well as the proof of security and correctness are completely classical. The system is characterized by m,q and a probability distribution \chi on \mathbb{T}. The setting of the parameters used in proofs of correctness and security is

The cryptosystem is then defined by:

The proof of correctness follows from choice of parameters and some probability analysis. The proof of security is by reduction to the decision version of LWE: an algorithm for distinguishing between encryptions (with above parameters) of 0 and 1 can be used to distinguish between A_{s,\chi} and the uniform distribution over \mathbb{Z}^n_q \times \mathbb{Z}_q

CCA-secure cryptosystem

Peikert[2] proposed a system that is secure even against any chosen-ciphertext attack.

See also

References

  1. ^ a b c d e f Oded Regev, “On lattices, learning with errors, random linear codes, and cryptography,” in Proceedings of the thirty-seventh annual ACM symposium on Theory of computing (Baltimore, MD, USA: ACM, 2005), 84-93, http://portal.acm.org/citation.cfm?id=1060590.1060603.
  2. ^ a b c d e Chris Peikert, “Public-key cryptosystems from the worst-case shortest vector problem: extended abstract,” in Proceedings of the 41st annual ACM symposium on Theory of computing (Bethesda, MD, USA: ACM, 2009), 333-342, http://portal.acm.org/citation.cfm?id=1536414.1536461.
  3. ^ Chris Peikert and Brent Waters, “Lossy trapdoor functions and their applications,” in Proceedings of the 40th annual ACM symposium on Theory of computing (Victoria, British Columbia, Canada: ACM, 2008), 187-196, http://portal.acm.org/citation.cfm?id=1374406.
  4. ^ Craig Gentry, Chris Peikert, and Vinod Vaikuntanathan, “Trapdoors for hard lattices and new cryptographic constructions,” in Proceedings of the 40th annual ACM symposium on Theory of computing (Victoria, British Columbia, Canada: ACM, 2008), 197-206, http://portal.acm.org/citation.cfm?id=1374407.